Extending automatic transcripts in a unified data representation towards a prosodic-based metadata annotation and evaluation

نویسندگان

چکیده

This paper describes a framework that extends automatic speech transcripts in order to accommodate relevant information coming from manual transcripts, the signal itself, and other resources, like lexica. The proposed automatically collects, relates, computes, stores all together self-contained data source, making it possible easily provide wide range of interconnected suitable for analysis, training, evaluating number processing tasks. main goal this is integrate different linguistic paralinguistic layers knowledge more complete view their representation interactions several domains languages. chain composed two stages, where first consists integrating annotations recognition data, second further enriching previous output prosodic information. described has been used identification analysis structural metadata transcripts. Initially put use detection punctuation marks capitalization recovery also recently studying characterization disfluencies speech. It was already applied Portuguese corpora, English Spanish Broadcast News corpora.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Metadata Enrichment for Automatic Data Entry Based on Relational Data Models

The idea of automatic generation of data entry forms based on data relational models is a common and known idea that has been discussed day by day more than before according to the popularity of agile methods in software development accompanying development of programming tools. One of the requirements of the automation methods, whether in commercial products or the relevant research projects, ...

متن کامل

task-based language teaching in iran: a mixed study through constructing and validating a new questionnaire based on theoretical, sociocultural, and educational frameworks

جنبه های گوناگونی از زندگی در ایران را از جمله سبک زندگی، علم و امکانات فنی و تکنولوژیکی می توان کم یا بیش وارداتی در نظر گرفت. زبان انگلیسی و روش تدریس آن نیز از این قاعده مثتسنی نیست. با این حال گاهی سوال پیش می آید که آیا یک روش خاص با زیر ساخت های نظری، فرهنگی اجتماعی و آموزشی جامعه ایرانی سازگاری دارد یا خیر. این تحقیق بر اساس روش های ترکیبی انجام شده است.پرسش نامه ای نیز برای زبان آموزان ...

Towards a Unified View of Design Data and Knowledge Representation

Adequate information modeling in non-standard application areas (e.g. engineering applications such as CAD/CAM, VLSI design or knowledgebased applications) requires the abstraction concepts of classification, aggregation, generalization, and association. The Molecule-Atom Data model (MAD) designed for the effective support of such an information model is justified and described with its essenti...

متن کامل

ONEMercury: Towards Automatic Annotation of Environmental Science Metadata

The rapid growth of diverse data types and greater volumes available to environmental sciences prompts the scientists to seek knowledge in data from multiple places, times, and scales. To facilitate such need, ONEMercury has recently been implemented as part of the DataONE project to serve as a portal for accessing environmental and observational data across the globe. ONEMercury harvests metad...

متن کامل

Automatic structural metadata identification based on multilayer prosodic information

This paper discriminates different types of structural metadata in transcripts of university lectures: boundary events (comma, full stops and interrogatives), and disfluencies (repair). The disambiguation process is based on predefined multilayered linguistic information and on its hierarchical structure. Since boundary events may share similar linguistic properties, in terms of f0 and energy s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Speech Sciences

سال: 2021

ISSN: ['2236-9740']

DOI: https://doi.org/10.20396/joss.v2i2.15035